Convergence of [/-statistics for interacting particle 

systems. 

P. Del Moral * F. PatrasJ S. Rubenthaler* 
February 1, 2010 

Abstract 

The convergence of [/-statistics has been intensively studied for estimators based 
on families of i.i.d. random variables and variants of them. In most cases, the indepen- 
dence assumption is crucial [Lcc90 , dc99 . When dealing with Feynman-Kac and other 
interacting particle systems of Monte Carlo type, one faces a new type of problem. 
Namely, in a sample of N particles obtained through the corresponding algorithms, 
the distributions of the particles are correlated -although any finite number of them 
is asymptotically independent with respect to the total number N of particles. In the 
present article, exploiting the fine asymptotics of particle systems, we prove convergence 
theorems for (/-statistics in this framework. 

Keywords : interacting particle systems, Feynman-Kac models, U-statistics, fluc- 
tuations, limit theorems. 

Introduction 

The convergence of [/-statistics has been intensively studied for estimators based on families 
of i.i.d. random variables and variants of them. In most cases, the independence assumption 
is crucial. When dealing with Feynman-Kac and other interacting particle systems of Montc- 
Carlo type, one faces a new type of problem. Namely, in a sample of N particles obtained 
through the corresponding algorithms, the distributions of the particles are correlated - 
although any finite number of them is asymptotically independent with respect to the total 
number N of particles. It happens so (and this is the main contribution of the present article 
to show) that this asymptotic independence is enough in practice to insure the convergence 
of [/-statistics based on interacting particle systems. In the following, we prove therefore 
the convergence of [/-statistics for different particle systems under mild assumption that 
are satisfied by Feynmann-Kac particle systems. The case of Bird and Nanbu systems 
also fits in this framework and will be treated elsewhere by the third Author |Rub09j. To 
study the asymptotics of Feynman-Kac systems, whose properties are crucial to ensure the 
convergence of the statistics, we will use a functional representation, as introduced in |De09j 
in the framework of discrete interacting particle systems. 

The article is organized as follows. To fix the notations and the general framework 
of interacting particle systems, we first recall the (Feynman-Kac, continuous) interacting 
particle model. The model first appeared in quantum physics, in the work of Feynman and 
Kac in the 1940-50's, as a way to encode the motion of a quantum particle evolving in a 
potential (e.g. the interaction potential of a quantum field theory, viewed as a perturbation 
of the free Hamiltonian) in terms of path-integral formulas. It was realized progressively that 
interacting particle systems could be used in incredibly many different settings in probability 
and statistics. A detailed list of the (still expanding) application areas of these models is 
contained in [De04] . to which we refer for further informations. Recall simply, since we 
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focus here on its statistical features, that the model is mainly used, in applied statistics, 
as a Bayesian nonlinear filtering model: the motion of the particles is driven by a diffusion 
process and the potential encodes the likewood of the states with respect to observations or 
to some reference path. 

We study then the associated empirical joint distributions of a finite number k of particles 
and study the convergence of the distributions in terms of the total number N of particles of 
the system. This is closely related to our previous joint work |De09| on discrete Feynman- 
Kac models -although the continuous hypothesis we use in the present article leads to some 
simplification of the tricky combinatorics that showed up in the discrete framework. 

We turn then to [/-statistics for interacting particle systems and prove that under mild 
asumptions on the behavior of the system (satisfied e.g. by Feynman-Kac and Boltzmann 
systems) several asymptotic normality properties holds. 



1 Feynman-Kac particle systems 

Let us consider a E- valued Markov process X t , where E — M. d (or an arbitrary metric 
space) with a time-inhomogeneous infinitesimal generator L t , continuous trajectories, and a 
positive bounded potential function V t , < V t (x) < Voo- We assume that the distribution of 
Xo is 70 = 170- Notice that these hypothesis are meaningful for most applications, but could 
be accomodated to more general ones, see [De04 . We are interested in the unnormalized 
(resp. normalized) distribution flows -ft and rj t that are solutions, for sufficiently regular test 
functions / and under appropriate regularity conditions, of the nonlinear equations: 

j t lt{f) = lt{L t {f))- lt {fV t ) 



and 



In terms of X t , we have: 



± Vt (f) = Vt(L t (f)) + Vt(fMVt) - V t )). 



Ittf) =E(/(X 4 )exp (- jf V s {X s )dty , nt(f) = ^ ■ 



1.1 Definitions and notations 

Let us fix first of all some notations. For q E N* , we write [q] :={!,..., q}. For q,N £ N*, 
we set (q,N) := {s e [iVj^s injective}. For q even, we write I q for the set of partitions of 
[q] in pairs. We have 

4LT = q[ 

^ q 29/ 2 (|)! ' 

The set of smooth bounded (resp. smooth bounded symmetric) functions on E q is written 
Bb{E q ) (resp. B" b ym {E q )). We also write B^ m (E q ) for the set of symmetric functions: 

B s ym (E q ) := |f EB s b yn \E q ) : J^Ffa,. ..,x q )<y t (dx q ) = j . 

Notice that the set of functions B^ m (E q ) depends on t, so that a better notation would be 
B^ y ™(E q ). However, since in practice the abbreviated notation should not lead to confusion, 
we decided not emphasize this dependency for notational simplicity. We write simply Bq(E) 
for the centered functions: F G Bb(E) : f E F(x)j t (dx) = 

The empirical (possibly random) measure associated to a (possibly random) vector x = 
(xi, . . . , Xji) € E N is given by 

I N 

i=i 
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We have, for all q G W, F : E q 



We also consider the corresponding [/-statistics: 



m( x )®*(F) = 7 ±- F(x s{1) ,...,x s{q) ) 

1 )q se(q,N) 



) ^ ] Fsyrn , ■ ■ ■ , Xj g ) 



l<ii<...<i,,<JV 

where (iV), = = #{q,N) and {F) sym (y 1: . . . ,y q ) := jE„ e s,%(i)vi!/<7( g )) , 

with the symmetric group of order q. 

The notion of differential for sequences of signed measure^] will also be useful. The total 
variation norm is written || || t „, so that, for any linear operator L on Bb(E q ), 

\\L\\ tv := sup \L(f)\ 

fGB b (E"):\\f\\<l 

Let (& n )n>i, be a uniformly bounded sequence of measures on E q , in the sense that 
sup Ar>1 ||6 w ||t v < oo. The sequence 6^ is said to converge strongly to some measure 8, as 
N t oo if and only if 

V/ g B b {E) lim B N (f) = Q(f) 

N^oo 

Definition 1.1. Let us assume that & N converges strongly to 0. The discrete derivative of 
the sequence (& n )n>i is the sequence of measures (d& N )N>i defined by 

do N -.= n [e N -e] 

We say that <d N is differentiable, if dQ N is uniformly bounded, and if it converges strongly 
to some measure d<d, called simply the derivative of Q N , as N j" oo. 

The discrete derivative d® N of a differentiable sequence can itself be differentiable. In 
this situation, the derivative of the discrete derivative is called the second derivative of 6^ 
and it is denoted by d 2 <d = d (<90), and so on. 

A sequence Q N that is differentiable up to order (fc + 1), has the following representation 
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0<Z<fc 

with sup w>1 ||9 fc+1 Ar ||tv < oo , and the convention <9°0 = 9, for I = 0. 



1.2 A genetic particle model 

A particle system approaching the measures r\t (and therefore also jt) is the following. At 
t = 0, the random vector S t = , . . . , ^) is a family of i.i.d. random variables distributed 
according to 70 = 770- Each entry Q of the vector diffuses according to the generator L t , 
independently of the other entries. Each entry £J has an exponential clock of parameter Voo 
(independent of all the other variables defined up to now). When the clock of £J rings, say 
at r, it can 

• jump to a randomly (and uniformly) chosen particle in the family, including itself, 
with a probability V y ^ ^ 

• stay where it is with probability 1 — ^ife ^ . 

"CO 

1 From now on, measure will have the more general meaning of signed measure 
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Up to a renormalization of the jump type generators (which is convenient for our present 
purposes), this is the model described in |De041 Sect. 1.5.2]. Notice that the law of Q 
depends on N, so that a more consistent notation for Q would be £ t ,l -whenever we want to 
emphasize the dependency on N, we will switch to this second notation. The corresponding 
empirical^] measures and empirical U-statistics are given (and related) by: 

= m(S t )=m(^,...,4 JV ), 



7f = 7 f (1) • V? , with 7 f (1) = exp I - J (V s )ds 

( 7 f)^(F)=7f(l) 9 •(»7f) 09 (i ;, ). 

These are the analogs in continuous time to the random measures and U-statistics defined 
and studied in |De09] in a discrete time setting. We recall that rj^ and 7 j are known to 
converge as N — > oo to % and 7 t |De04) . 

In oder to study the convergence of the (empirical) U-statistics ( 7 f ) 09 and (i]£ r ) Qq , we 
are going to rewrite them by means of functional expansions (actually, Laurent series in the 
parameter N). These expansions will allow us, later, to control the convergence and (among 
others) to extend to particle systems the classical central limits theorems for U-statistics. 
Deriving these expansions is the main purpose of the first part of the article, together with 
first results of convergence. 

For these purposes, it is useful to introduce an auxiliary g-particle system. The reasons 
for its introduction will become clear later and stem from a backward analysis of Feynman- 
Kac trajectories of families of q particles. 

Definition 1.2. The auxiliary system of q particles, £(,•■•, £t i s defined as follows: the 
random variables £q, • • • >£o are independant of law 770- Moreover, the particles 

£j , . . . , Q diffuse according to L t (1.1) 

(one more time, independently of one another) and undergo the following jumps. For any 
(h j) S W\ 2 ■• i 7^ J j there is an exponential clock of parameter Voo/N and a corresponding 
Poisson point process T^ d \ T 2 (lj) , ... of parameter V^/N. The T^' j \ T 2 (l,i) , . . . are named 
the ringing times. At a ringing time t <G {T± , , ■ ■ ■}, 



V<X, 



<— £f with proba. 
<- H with proba. 1 - 



(1.2) 



The notation Q <— ^ means that Q jumps to (or is substituted by) £[. When q = N, 
the particle systems (£$,..., Ct V )*>o has the same law as (£j , . . . , £f )*>o- 



We set, for an arbitrary F € Bb(E q ), 

F e ((H . • . , i q s )o<s<t) = F(H,. . . , H) exp (- JjVsdl) 



V s (Z q s )]ds 



and 



E t , k {F) :=E{F e ((il, . . .,il)o< s < t )\k rings on[0;t]} 



Notice, for further use, that E t ,k(F) does not depend on N, This is because, as a consequence 
of the general properties of Poisson point processes, conditionally to the hypothesis that there 
are k rings on [0,i], the distribution of the ringing times is uniform on [0,t] and therefore 
independent of the parameter Voo/N; since the other parameters of the process (L t and the 

jump probabilities -^r^~) are independent of N , the property follows. 



2 The adjective empirical refers, in the present article, to any measure, process or statistics obtained from 
a particle system approximation. 
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1.3 Expansion of the unnormalized measure 

Let us write for an arbitrary F € Bb(E q ), 

Q&(F) :=E(( 7 f)^(F)). 

We refer to Qt,q as ^° the empirical unnormalized measure associated to the particle system 
£f. Since the joint distribution of the sequence £^ is invariant by permutations, 

Q^ q (F) = Q^ q (F sym ) , and we can assume without restriction that F is a symmetric function. 
The first question we adress is the (exact) computation of the speed of convergence of the 
empirical unnormalized measure Qf to jf" 1 . 

Theorem 1.3. For F € B & h ym (E q ) , we have the Laurent expansion 

= 7 f*(F) + 53 (-l)*^-**,*^) ( L4 ) 

fc,i>0 
fe+i>l 

= 7f 9 (^) + E ^ E ( w^~kv ^ ~ l)v ™ t)r Et ^ F) ) ( L5 ) 

r=l fc=0 V 'V ^ / 
where A := g(g — l)^'- I 71 particular, is differentiate up to any order with 

d r® ttq (F) = ± ±gl (q( q - l) Voot y E t , k (F) . 

Notice in particular that, although the E t .k(F)s depend on the choice of the upper bound 
Voo for the potential function V, the coefficients of the development do not (they are the 
derivatives of Qt, q , that do not depend on V^). 

Proof of Theorem \1.3l 

We have first, for / € B b {E) 

d N N " (v t (®{m)-f(i;)] 



dt wt w " wt v lKJ '" ^ \ N 2 



and 



E( 7 f(/)) = E( 7 f(£t(/)))-E(^(F t )7f(/))+E 7 f(l) E 



d 

For F £ B^ m (E«), let us introduce the useful notation: F l : E q ~ x — > B b {E), 

Fi(xi,...,x q -i)(y) := F(xi,...,Xi-i,y,Xi, ...,x q -i) 

and let us extend L t to functions in B^ vm : 

<? 

L t (F)(x 1 , i„) := E L t (Fi(xi,..., Xi-!,Xi +1 , a;„))(ii). 
i=i 

We get: 

|E(( 7 f) G «(F)) (1.6) 
= E(( 7 f )®*(L t (F))) - E(q V ?(V t )( % N r 9 (F)) 

!q N 
7f(i) 9 E EE^ S(4) ) 
s£<q,N> i=l fc=l 

/ 77.^s(l) ^s(i-l) Mi+l) Ml)\r^-k\ nrMV Mi) Mo)\\ } 



The second term in the right hand side of (jTTBj) reads: 

N 



V ^ 9 se<q,AT> fe=l 



E i-«^f E ^ l) ."..e? (,) )(v,(tf (1) )+---+v,(^ a) ))) + 

- q N-(N) ^ '"'' ft ^ ' 

v 79 se<g+i,iv> 

Similarly, making use of the symmetry properties of F, the last term in the right hand 
side of (|1.6[) reads: 



E EWtf'W 1 '.....^.*?) 

V ^ ' 9 sG<q,N> k=l 

^ ^ 9 sG< 9 ,JV> 



The first term in this last sum decomposes then into (making use once again of the 
symmetry properties of F): 

E (inw E m(CtS(1)) + ■ ■ ' + v ^ iq) » F & (1) > ■ • •'^ (9 " 1) ^* (?) ) ] - 

\ ^ ' q se<q,N> 

^ ' 9 se<q,N> k=l 

E ( q ifrNY E n^.-,^.^^^) 

\ ^ ' 9 se<g+i,JV> 

Reorganizing the summands in these expansions, we get finally that the two last terms 
of (|1.6[) sum up to : 



E 



E[1 mr E ^t* cl) ,...,^ (,) )(v*te* (1) ) + --- + w ( * ) )) 

' q se<q,N> 

7t N (l) q 
(N) q 



\N ■ 



se<q.N> i,r=l 

-F(e t v,...,e t {9) ) 



•,e. 



We set 



F Vt (xi,...,x q ) := (V" t (a;i)H h V r t (x 9 ))F(xi, . . . , x 9 ) 

F^' r) (xi, . ..,X q ) = V t {x i ){F(x 1 ,. . .,Xi-i,X r ,X i+1 , ...,X q )- F(xi, . . .,X q )). 

Then the equations above give 
d 



(F)=Q» q L t (F)-Q» q (F Vt )+ ^C 9 (4l' r) ) 



l<i,r<q 
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Qf, q (F) = E (f@, H) exp (- jf* V t (£) + • • • + ) . (1.7) 



□ 



And so 



We then obtain Equation |L3j) of the theorem by noticing that 

(Ail fe e~ At 

P(fc rings on[0,il) = ■ 

k\ 

We obtain Equation (|1.4|l by developping the term e~ At and by noticing that 

E tfi {F)= 1 f\F) . 

1.4 Wick theorem for interacting particle systems 

We say that two particles of the auxiliary system £ l , £■? , i ^ j interact at t if and only if 
Q jumps to £ J or ^ jumps to at t. We say that a trajectory of the auxiliary system of 
particles Q , . . . , Q is exactly Wick-coupled on [0, t] if and only each particle of the system has 
exactly one interaction with another particle between and t (notice that the existence of 
such trajectories requires q to be even). We write Wt for the set of Wick-coupled trajectories 
on [Q,t]\ the set Wt embeds into the set of trajectories with exactly q/2 rings on [0,i\. 

Theorem 1.4. If F £ B s ym {E q ) then, for r < q/2, d r Q^ q {F) = 0. Moreover, for q even, 

(f\ - i)Voot) q/2 . . 

HI <Qt,q{I) - J-jy\ Kq/2{I) 

In particular, 

hmsupiWW 9 (F)| < (g(g- l)y)^ 2 

Proof of Thm \1.4\ Recall that F is symmetric and belongs to B^ m (E q ) so that, for any 
* < Q, ! E F{xi,... ,x q )~f t (dxi) = 0. 

The proof follows then from Thm 11.31 together with the observation that, if there are 
strictly less than q/2 rings, then at least one of the particles does not interact with the 
others so that, if k < f, E t . k (F) = 0. □ 

Corollary 1.5. With the assumptions of Thm \1.3\ and a even, we have: 

d^QUF) = ^(Ko^ /2 E((F e (a,...,i 9 )o< s < t )|W t ). 

//, furthermore, F = ( f 1 ® — (g) f q ) S ym (with /,..., f q centered with respect to ^t) then 
we have the Wick-type expansion 

2i/ 2 (n /2V 

E(F e ((^,...,e?)o< s < t )|W t ) = vL± II W®/ J ) 

q ' ^ei,{i,j}e/« 

where the sum is over the setl q of partitions I q of[q] into pairs, i,j being paired if{i,j} £ I q 
and where, for a function G of two variables 

E' t>1 (G) :=E(G(A t \A?)ex P (- J \V a (fi) + V B (j%)]<h\ \one ring on [0,t]) 

with jl\ , fij an auxiliary system of two particles ( defined with the same rules as a general 
auxiliary system of q particles). 
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Proof. Indeed, from the enumeration of possible interactions between particles and the gen- 
eral properties of Poisson processes, we have: 



F(W t \q/2 rings on [0,i]) = 



{q(q -l))9/2 



The first part of the Corollary follows. 

The last part follows from the observation that the distributions of two particles (or 
of blocks of particles) are independent conditionnally to the assumption that they do not 
interact (cither directly or through interactions with other particles), so that quantities such 
as E(i^ e ((^, . . . , £|)o<s<t)|W{) ma y be computed by disjoint integrations over blocks (pairs 
in this particular case) of interacting particles. Since Card(T q ) — 2 i/^f q /2)\ 1 ^ e Corollary 
follows. □ 



1.5 Expansion of the normalized measure 

The purpose of the present section is to prove for the measures associated to the empirical 
[/-statistics (f^) 09 properties similar to the ones obtained in the unnormalized case. We 
conclude the section with a Wick formula in this setting. 
We set, for F e Bb(E q ): 

F^ q (F)^E((^f q (F)) . 

Theorem 1.6. The sequence (F^ q ) is differentiable up to any order. 

Let us mention that an explicit formula (that we omit) for the derivatives d l F ti<] follows 
immediately from Fla 11.131 in our proof. 

Proof. Let us expand first (f?^) 09 in terms of (7 t Ar ) 09 . Let, in the following, F be a bounded 
symmetric function of q variables. We first have: 

(ri?) Qq (F) = ( 7 f) 09 (F) 7 f(l)-* 



i 



We set Ft := ^nw an d get: 
with 

* 7.(1) 

Recall the decomposition (that holds for any m, see |De091 Lemma 4.11]): 



v * ' 0<fc<m l<fc<g v 4 

from which it follows that: 

( V ?) m (F) = (l?r q (F t ) (9-1 + *)*^ 



k-\-rn 



fc! 

0<fc<m 



(^) 0, (£)(«f) m E ci+m I Y^vF 

l<k<q 

(l) + (2) • 
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For an arbitrary function / on E, we set: 8 t (f) := - V ^P . Then, recalling the equality 



7t(l) = exp ( - J Q r) s (V s )ds) , we get 



7f(D _ f , 



7t(l) 

= i- [\v^(Vu)-Vu(V u )]^^d U 







t 



1- lu (Ou(V))du , 



o 

where the last identity follows from the rewriting 



7uW \ 7«U) / 



In particular, we get, for an arbitrary fc > 0, 



fe „ fc 



7t(l) / J>i,...B k elo,t]tA 
= fc! / n 7 £(0 s< (F Si ))etei...ds fc . 

J0<S1 <---<st<t „■_-! 



We therefore have 



(1) = E (?-! + *)* / 



0<k<m "0<at<-<a k <t 



( 7 f) «(F t )d Sl ...d Sfc 



E (?-i + *)*x / 7s,f JV (^; 9 ..., Sfc , t (F))d Sl ...^ (1.10) 

J0<si<-<s fc <t 



0<fc<rn 



withs := { Sl , . . . ,«*}, 7> *f " = 7a >. . . 7 i!®(7 t Af ) 09 , S k J(F) = 6 S1 (V S1 )®- ■ -®9 Sk (V Sk )®F t . 
We introduce the operators (for G of fc + g variables, j < fc) 

DjG(a;i, . . . ,x fe+g ) = — -G(xi, . . . , x k+q ) + — E G^zi, ■ • ■ , Zfe+gX^i) 

j + l<i<fc+g 

We then have, for any empirical measure m(x) = jt Xa<i<iv ^» an< ^ f° r an y function G 
of fc + q < N variables and any measure fi on E^ 1 



/' : 



(m(x) ® m(x) 0(fc+,? - j) )(G) = p ® m(x)° (fe+, ^ +1) (D 3 G) 



For G function of q + fc variables and s < t and J < k + q, we dehne then the Markov 
operator 

Ri t G(x u ...,x k+q )= (1.11) 

E ( G(x l7 . . .,x k+g ^,ils, ■ ■ .,$L.)e-Jr-<y«~&>+---+v*&))*> 

\Co = x k+q-j+l ; • ■ • , Co = ^fc+q) ! 

where the £ l are defined as before, excepted for the initial condition that reads now £q = Xk+i- 
Reasoning as before (and using the auxiliary system as in Sect. 11.3)1 . we get for st<t 
and H a function of k + q variables, /i a measure on E J , 

E( M ®( 7 f>( 7 f) s <0(#)I^J = M®(7i!®(7i!) 09 )^ fc , t (i?) 



= M®(7i!) G(9+1) ^+i<,(i?), 
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where Tt stands for the natural filtration on the probability space underlying the particle 
system. Here, we took advantage of the Markovian properties of the system and of Eg 11.71 
So, by recurrence : 



E(7 s fe ,f N (G)) = ( m r (k+q) Rtl D ^t q ---D k Rl k AG) . 

We proceed now as for the unnormalized measure and introduce still another more general 
auxiliary particle system (£j , . . . , ^ +9 )t>o- The system depends on k (and q), but we do not 
emphasize this dependency to abbreviate the notations. This should not create ambiguities 
since the dependency on k should be obvious from the context in the following formulas. 

Let Si, S2, • • • , S k be the order statistics of k uniform random variables in [0, t] (we take 
k i.i.d. uniform variables in [0,t] and sort them). The S{ are independent of all the other 
variables. We set Si :— t for k < i < k + q. The system (which agrees with the auxiliary 
system £| when k — 0) is defined as follows: 

. The law of (&..., is 

• The particles diffuse according to L s and undergo the following jumps: 

• Each couple (i, j) £ [q + k] 2 has an exponential clock of parameter Voc/N. At a ringing 



time s: 



If s > Si or s > Sj, then nothing happens. 
Else, 



4— £| with proba. V y^ ^ 
<- il with proba. 1 - 



w , ,■> ... i k-i + q 



— At s — Si, we make the following replacement (with U sampled uniformly in 
{i+l,...,k + q}) 

C <- C with proba. y 

We say that there has been a parasite at s, so that the system has at most k 
parasites. 

With the notation 
the equation (jl.lOp gives 



E((1)) = ^ (fc t, + g) * g(ffi(&. • • • , fo, ff+ ■ . ■ , lf +g )) 

0<k<m 



(k -l + q) k 



w ^ {ksVoolN) r e-^ v °°l N ^ 6ktqtll 

E< 2^ ~\ HS s q t (^ Sl , . . .)\r rings , Si, . . . , S k ) 



with 



As := q(q - l)(i - S k ) + (q + l)q(S k - S k - t ) + ••• + (? + k)(q + k - l)(5i - 0) 

for which we will use the crude bound Ag < (g + fc) 2 i. 
We set 

E k>r>l = E(A^E(^;«(4 1 i ,...)\r rings , S 1} . . . , S k )) , 
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and notice that the parasites are sampled independently of the Si and of the ringing times, 
so that: 

k 

Ek, r ,i =^P(* parasites)E(AgE(5g'j(|g 1 , . . . )|i parasites, r rings, S±, Sk)) ■ 

i=0 

The probability P(i parasites) depends on t, q and k, but we omit the corresponding indices 
to simplify the notation. The following lemma is crucial: 

Lemma 1.7. For i + r < k, we have: 

£>k,r,i,i :=E(A i s E(^ s 'J(^ 1 ,...)|i parasites, r rings, Si, Sk)) =0 , 

so that: 

k 

Ek,r,l = P arasiies )Ek,r,l,i ■ 

where \x] stands for inf{fc £ N : fc > a;}. 

Indeed, for i+r < &, at least one of the j < k does not interact with the other particles, 
so that, by Fubini's lemma, 9s 3 {Vs 3 ■,)(£$. )e~ f° 3 v "^) d ™ can be integrated independently. 

However, conditionnally to the assumption that does not interact with the other particles, 
we have: 

E(e Sj (VsM^~ IoSj Vuiii)du ) = vsAOsAVs,)) = o, 

and the lemma follows. 

We then have (by developping the exponential in E((l))) 



\- (k+ q -i)k ^ ^(-iy {VooY ri1 „. 

- fc! 2- iVT! 2- fj I TV j 1 6) 

0<fc<m r=0 i=0 v 7 

P(j parasites)l)fc, rir+ jj- . 

By the construction of the auxiliary system of particles, P(j parasites) = O(to). For further 
use, the explicit formula is: 



P(j parasites 



/ (g + fcl - 1) g + % - 1 -pr / g + /~l 

N '"IV 11 I AT 



N N \ N 

i<ki<— <fc 3 <& \ ie[fc]\{fei,...,fc 3 } 

so that there exists coefficients (a/) (depending on k and g, as usual we omit the correspond- 
ing indices for notational simplicity) such that 

k 

P(j parasites ) = ^ — j . 

1=3 

Because of the condition i + r < \ in Lemma 11.71 it follows that the coefficient of any 
to in the expansion of E((l)) is a finite sum of coefficients which are independent of N. 
This is essentially all what we need in order to prove the Thm. 11.61 provided we are able 
(Step 1 below) to give a suitable upper bound for the remainder terms in the expansion of 
E((l)) at a given order, and (Step 2) to give a suitable upper bound to E((2)). This is the 
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(cumbersome) purpose of the following conclusion to the proof of the Thm. 11.61 that we 
include for completeness sake but that the reader may skip, if she/he wishes. 

Step 1: Upper bound for the remainder terms in the expansion of E((l)) at the order p, 
assuming that m is even and m = 2p. 

Notice that for an arbitrary bounded function F, 

\\P\\oo < H^llooe*^ , 
K;£||oo <2 k V^ k +^\\F\\ 00 , 

and so 

\E k , r ,r+i\ < (t(q + k) 3 ) r+i 2 k V£ > e*l k+ 4 V °°\\F\\ 00 . 
In the Fla 11.121 for the expectation E((l)), the sum of the terms with a factor l/N r+t , 
T + i > p is, in absolute value, equal to (we use here: Vx > 0, J2i>j IT — jr eX ) 



E 

0<k<2p 



k\ 



* E 

0<fc<2p 
1 

X 



(p-r) 



{k + q- l) k 
k\ 

Vootiq + fc) 
N 



E v oo \ " 

j\r r r! 

r=0 ' i>(p— r)+ 

p-1 



N 



Ek. 



r,r-\-i 



El^T 2fc ^e t(fc+9)y -||^||c 



N r r\ 



r=0 
2 \ P~ r 



exp 



Vcotjq + k'y 
N 



2\r 



(t(k+ q y) 



+ E ^h^V^^lWUexp ( V <~*(9 + k ) a ) (/( , /, 



N 



< 



- E 

NP 



(k + q-l) k 



NP ^ k\ 

0<k<2p 

-p-l 

J2VP, +k 2 k e t(k+q)v ^\\F\\ 

_r=Q 



1 



(p — r)!r! 



+2 



t i+9fre e v4H,)|| 



FW^e 1 



(t{q + k) 2 fe^r 



= °^p ) - 



Since, by Fla 11.131 E((l)) 1S the sum of this term with a polynomial in -k, the proof of 
the Step 1 is concluded. 

Step 2 : Upper bound to E{{2)). 

Since 7t (l) = e - '>'^ v ^ du and 7 t w (l) = e~ < (y " )rftl , we get: 



|E((2))| < HFlleoe*'^ c£%{e tv - + 1)" n«) 2p ) . 

\l<k<q J 

We have, with the same auxiliary system of the £'s as before but with q — 0, k = 2p and 
with Si, ... , S2p the order statistics of 2p uniform variables on [0,t], 

E((«f) 2p ) 

We get (with, now, A s = (2p)(2p- l)5i + (2p- l)(2p- 2)(5 2 - Si) + • • • + 1 X (5 2p - S 2p _i)) 
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+°° / A t/ \ r -i 

E((uf ) 2 *) = E(£ ^ ±e-W N E(9 Sl (V Sl ) ® • • • ® e S2p (V S2p ) 
= E (£ f J ^e- KsV ™' N Hi parasites) 

r=0 ^ ' ' i=0 

x E(9 Sl (V 3l ) ® • • • ® #s 2p , • • • , l^) 

e -/ Si -/o S2p v il (a 1 )...vi p (e ai> )«j« l ...«te 3P | r ringSj . parasiteS;5lj . . . )52p)) . 



We have 



2p- 1 



N 



P(i parasites) < C^,^ 

and know that, if r + i < p 

>S 2p 



E(0 Sl (F Sl ) ® • • • ® S2p (^S 2p ) , • . • , fsl ) 



-Jo 1 -So 2 " V ui(iL)-Vu v i& )du 1 ...du 2p , . . „ „ ss 

e p^>" 2p ' p |r rings, « parasites, i> 2 p)) = 



So that 

+2?(t(V 2 <- " """ J 

Np 



r=0 i=(p— r)+ 



and 

|E((2))| < ^-x||F|U e ^ ( £ ^(e'^+l)' 

\l<k<q 

2p-l , 

2 C| p _ 1 (2p-l) i e 2 ^2 2 ^ = 0(— ), 



x e t(2p)^o« 



or 



i=0 

and the proof of the Thm is complete. □ 
Corollary 1.8. If F <= B s ^ m (E q ), then 

• a J P t , g (F) = 0, V/ < q/2 

• for q even, d q ^ 2 Pt,q(F) simplifies to 

d q/2 Pt, q (F) - (pjjfafa ^ l)0 9/2 E(^ e (e7, ■ • ■ ,l t 9 )l|«n ffS ) , 

Proof of Corollary \1.8[ Indeed, the same argument as in the proof of Lemma 11.71 shows 
that, for a centered function F £ B^ m , the terms Dk t r,r+i,j vanish for j + r < (this is 
because at least one of the £ J , j < k + q does not interact with the other particles if there 
are less than rings and parasites). It follows that, if q is even (resp. odd), | (resp. 
^i-) is the smallest power of appearing in the expansion of E((l)) for m > q and that 
the contribution of (2) is negligible with respect to 1/N q / 2 , from which the first part of the 
Corollary follows. 

The same reasoning shows that, for q even, only the case k = i = j = and r = % 
contributes to the coefficient of (77)^. As noticed before, for k = 0, the systems (£) and (£) 
have the same law. The second statement follows. 
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2 First results on asymptotic normality 

The empirical measure rj^ converges weakly to r\t- for all bounded /, fff {f) — \ Vt(f)- 

The Wick theorem (Cor. \1.8\i allowed us to improve on this convergence result. Namely, we 
know from the previous section that 

• For q odd and F £ Bl ym (E q ): N q / 2 E((^) &q (F)) — > 0, 

N—*oc 

• For q even, F € Bl ym (E«): AT9/ 2 E((r ?t JV ) Q «(F)) — > A„ /2 (F), 

N—¥oo 

where A g / 2 is a shortcut for the signed measure: 

Furthermore, for F = {fi® ... ® f q ) sy m, with /j e Bo(E), this formula simplifies to: 

a 9/2 (f) = e n ^x(m)=e n 

where W t := ^E' t>v 

A striking consequence of this Wick theorem for interacting particle systems and their 
U-statistics is that it leads immediately to an explicit result of asymptotic normality for the 
vector- valued random measures {rj^) q . Notice that the same properties hold for the system 
studied in |Rub091 and so we have the same asymptotic normality result for this system. 
More precisely: 

Theorem 2.1. V/ x , ...,/,£ B {E), 

N^( V f( fl ),...,^(f q )) i2J Af(0,K) 

with K(i,j) = m(fifi) + E(W t (/< ® /,)). 
Proof. For any . . . , u q , we have: 

N (y fi, iu lh H 1" ^g/? 



E exp Afyf log 1 



AT 



E ( CX P ( E { -^N l - k l 2 n ? {{iuxh + ■ ■■iu q f q ) k ) 

,k>l 



~ E(exp(VJV(i«i»?r(/i) + ••• +iu,% A (/,)))) 
x exp (^-^Vt((uifi + ■■■ +Ugf q ) 2 ))^j , 

where the last equivalence follows from the differentiability of Pj j . 
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We also have 
E ( exp [Nr^ ( log ( 1 



N I 1 / n , + ■ ■■iUqfq 



N 



AT 



E E ^ E 

\0<fc<AT l<ii,...Js 



i Uj 1 . . . Uj k 



<9 l<ii<—«fc<JV 



\0<fc<A' l<h,-,jh<Q 

N ^ E (-D fc/2 E ^r^E n 

fe>0,fc even l<ji, — )jfc<9 Ih&h {a,b}el h 

= E 2^(1/2)! E %' 1 ---% fe E(W t (/ J - 1 ®/ i2 ))...E(W i (/ J - fe _ 1 ®/ j J) 

fc>0,fc even ^ ' l<ji,---,jk<q 

fc/2 

E olT^V f E 



A;>0.A; even 



exp 



1<31j33<9 / 



2 

The Theorem follows. □ 

3 Convergence of empirical ^/-statistics. 
3.1 Hoeffding's decomposition 

We want here to study the convergence under fairly general assumptions of a sequence of 
empirical U-statistics (f]^ r ) G " 1 (F) when N —> +oo, where the t^s are empirical measures on 
some space E and F is a bounded symmetrical function on E q . 

Let us first write an analog of Hoeffding's decomposition (cf. Lee90 , de99 ). We fix 
some measure rj t . For 1 < k < q and any F bounded symmetrical on E q , we define 

F { - k \xi, ...,x q )= / F(xi, . . .,x k ,x k+1 , ■ ■ ■ ,x q )r)f {q ~ k \dx k+ i, . . .,dx q ) . 

JEi- k 

Notice that F^ q ' — F and that, for an arbitrary k, F^ is symmetrical on E k . We de- 
fine recursively (the function F being fixed -for notational simplicity, we do not stress the 
dependency of 0, ... in the following formulas): 

9 = F(xi,...,x q )r]f q (dxi,...,dxq) 

J El 

hF>(xx) = F { - 1 \x l )-6 

fc-i 

where J2(k,j) ^ is an abbreviation for J2i<i 1 <---<i J < k l^ j H x in ■ ■ ■ i^)]- 

The are bounded and symmetrical. They are constructed so as to be centered and 
to satisfy, for I < k, (/i (fc) )z = 0, see e.g. |Lee901 Sect. 1.6]. 
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3.2 Asymptotic normality for empirical U-statistics. 

As for classical U-statistics, the Hoeffding decomposition can be used to study the conver- 
gence of the empirical U-statistics (r]f) 0q . Notice first that, VW, 

9-1 

(r?f r«(F) = ( V ?r q (h ig) ) + tf) 0, (EE(' i(]) ))+ e ■ t 3 - 1 ) 

.7=1 

Notice that, VI < j < q - 1, 

5> w = j E ^(^(d,---,^)). 

So, with the notations of the first sections: 

(„ft®.(5>w) = i e I e ^r' 1 ),.,^') 

J -^ VJ « a'e<j,N> 

= ^(vf) Qj (h {j) ). (3.2) 
Theorem 3.1. Suppose we have, for all j > 2 and / € B^ jra {E : '), 

m{Vtf J3 {f)f) < §j , (3.3) 
for some constant C depending on t,N,j, ||/||oo- Assume further that, with F as above, 

s/Nr,F(hW) = VN( V ?(FW)-6) ^> Af(0,a 2 ) , (3.4) 

N— J- + 00 

tflGTl 

VN({r)?f q (F)-9) H Af(0,q 2 a 2 ) . (3.5) 
The Theorem follows from HoefFding's decomposition, together with the identity 

(tF)<*>(22hW) = qv ?ihW). 

(9.1) 

Corollary 3.2. For the Feynman-Kac particle systems of the first sections, we have: 

VN((r,?) &q (F) ~ 6) — > q 2 Vt ((F^) 2 )). 

Indeed, notice first that the Eq 13.41 holds for Feynman-Kac particle systems e.g. as a 
Corollary of Th. 12.11 The Corollary follows then from the following Lemma. 

Lemma 3.3. For the particle system described in Section^ we have, Vj, V/ G B ym (E :i ), 

E(((^) '(/)) 2 ) < §J , 
for some constant C (depending on t, ||/||ooJ- 



16 



Proof. 



^ ' ] ' a,b£(j,N) 



u fc=0 a,be(],N) 



#lm(a)nlm(b)=fc 



( (7V)) 2 E E c i/(C (1) =---.e. 

u '■>' k=0ae(2j-k,N) 



7^%(^) 02J (/®/) 



(W 3 ) 2 



^ ((TV),) 2 (iv) 2 ,- fe ^ n?t '"'' ?t J 

Notice first that, because of the very definition of (77^) 023 (as an average over arbitrary 
configurations of distinct systems of 2j particles), although / ® / is not a symmetrical 
function, (?7t ) 02:? (/ <8> /) = (?7t V ) 02j (Sym(f <E) /)), where Sym stands for the canonical 
symmetrization map from B / ^ m {E q ) <S> B^ m (E q ) to B^ m (E 2q ). Therefore, by Corollary 
El 3C , I gfj T E((, 7 i v ) 023 (/ ® /)) < For fc g {1,... we set ^(zr, . . .,x 2j . k ) = 
f(xi,...,Xj)f(xi,...,x k ,Xj +1 ,...,x 2 j-k)- We then have 



J VSt ) ■ • ■ J St ) J I, St >---)St )St y 

= {r} N r W-k) [gk y 

This function has the particular feature that there are a = 2j — 2fc indexes i such that 
J E 2 3 -fc gk(xi,. . . , x 2 j-k)vt{dxi) = (namely i = + 1, . . . , 2j - k). As for the proof of 
Corollary 1 1.81 we go back to the proof of Lemma 1X771 For the function g k defined above, the 
terms Dk>, r ,r+i,j' vanish for f + r < ^-j 2 -. So j — k is the smallest power of N appearing 
in the Laurent-type expansion of E((r?f J ^'-*^)) and 3C* fc , E^f) ^"^ (g k )) < 
This finishes the proof. □ 



3.3 Wiener integrals expansions 

The convergence of symmetric statistics to multiple Wiener integrals has been studied in- 
tensively, see e.g. the seminal |DM83] or the account of the classical theory in |Lee90] . We 
are interested here in proving that the main statements of the theory still hold for empir- 
ical U-statistics and focus on a particular example, namely the one of a symmetrical and 
bounded kernel constructed as a product of centered functions. 

From the technical point of view, the key issue w.r. to the classical theory of limit distri- 
butions for U-statistics based on i.i.d. assumptions is related to the existence of interactions 
between the particles of the system. Taking these interactions into account amounts in prac- 
tice to replace the central limit theorem by the analogous statement for interacting particle 
systems, namely Thm. 12.11 

Let us consider a kernel (a symmetrical bounded fonction on E q ) F of the form F = 
(/1 ® ... ® f q )sy m with /; g B (E). Then, F^ k) = = for k = 1, q-l, and we expect 
N"5 (jit) &g (F) to converge in law. This is indeed the case. 

Let us start with some classical results (see e.g. |RW97j for a systematical approach by 
means of the poset of partitions). For an empirical signed measure m(x) = ^ S Xi) we 

Ki<N 
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first have the formula of Rubin and Vitale f [DM831 . [Lee90] p. 85): 

v vev 

where fv,V — {v\, ...,Vk}, stands for f Vl ...f Vk and V runs over the set of partitions of [q] 
into disjoint subsets. 

Assume now that m{x) = N ■ rj^ =: m^. For \V\ — 1, N~2m^(fv) converges in law 
to Af(0, r)t(fy) + E(W t (fv ® fv))- For \V\ > 2, and since rif converges to r? t , N- l m N (f v ) 
converges to rjt(fv)- 

Let us consider now a partition V of [q] with j\ sets with 1 element, j'2 sets with 2 
elements, j q sets with q elements. We notice that j\ + 2j 2 + 2j 3 + ... + 2j p < j\ + 2j 2 + 
••■ + ?ig = 1 excepted if j'3 = ... = j q = (and then the two terms are equal). In particular, 

iV - ' i f] (-l) |vr|_1 (|V| - l)!mjv(/v) converges in probability to if j 3 + ... + j q > 0. By 

vev 

Slutsky's theorem it follows that, if N~im^ r q (F) — Ni(r]^) Qq (F) converges in law, it 
converges to the same limit as ^2 II ( — 1)' V "' 1 (|T /A | — l)!rn.jv (/v) , where the sum is restricted 

_ v vev 

now to partitions V of [q] with j'3 = ... = j q = 0. The next theorem follows. 

Theorem 3.4. With the above asumptions, we have the convergence in law: 

Ni( v n 0q (F) -+ ]T(-l) fc Yl ^.J-^-JE^/jJ--*^)- 

fe=0 l<ii<...<i,_ 2 fc<<? J 

Here, J runs over the partitions of [q] — {i\, i q ~2k} in ordered pairs J\ — Jfc = 

{jk,j'k} (where ji < j[ and ji < ... < jk)- Besides, [x] stands for the integer part of a real 
number x (the highest integer less or equal to x). The I(fi), i = 1, q are Wiener integrals 
of the functions They form a Gaussian family with E[J(/i)] = and E[/(/j)J(/j)] = 
1t(/i/i) + EW(/i®/i)). 

Corollary 3.5. In the particular case F — f® q with r) t (f 2 ) = 1, we get the convergence in 
law: 

Ni( V Fr«(F)^H k (I(f)) 

where Hk stands for the k-th Hermite polynomial. 

Indeed, in that case, since the number of partitions of [q] with q — 2k singletons and k 
P an " s * s 2 fc (g-2fc)!fc! ' ^ ne Theorem simplifies to a convergence in law of N~z (n^) eq (F) to 

2 k (q-2ky.k\ I ^ q 2fc ' 
where one recognizes the expansion of the qth ( "probabilistic" ) Hermite polynomial: 
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